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Description 

DYNAMICALLY CONFIGURABLE FAULT 

TOLERANCE IN AUTONOMIC 
COMPUTING WITH MULTIPLE SERVICE 

POINTS 

Background of Invention 

[0001] This invention relates to autonomic computing, and more 
particularly to a method for clustering processors and as- 
signing service points in a system for efficient, fault- 
tolerant, self-configuring and self-healing operation. 

[0002] Autonomic computing, which generally refers to design of 
multiprocessor computing systems which are self- 
monitoring, self-configuring, fault-tolerant and self- 
healing, is atopic of considerable theoretical and practical 
interest. One important consideration in building a suc- 
cessful autonomic computing system is to embed the fault 
tolerance of the system within itself to enhance its self- 
healing mechanism. The self-healing mechanism would 



require that in case of a fault the system would immedi- 
ately detect the nature of the fault and try to correct the 
fault. In a case where it could not correct for the fault, the 
system would minimize the ensuing performance degra- 
dation by assigning the task of the faulty processor to one 
or more other processors. In a typical computer architec- 
ture, this task of fault detection and management is either 
done by one of the processors or by a master processor. 

[0003] The processors comprising an autonomic computing sys- 
tem may be distributed over a geographically large area. 
Furthermore, the processors may be of many different 
types, running many different types of operating systems, 
and connected by a distributed network. The various pro- 
cessors are often geographically arranged in clusters. 
Such an arrangement does not permit having one master 
processor managing the fault tolerance of the entire sys- 
tem. It is therefore advantageous to have some of the 
processors do the fault management. These processors 
will be referred to herein as service points. 

[0004] a typical system utilizing a service point is shown in Fig- 
ure 1A. The system 1 includes several interconnected pro- 
cessors 2, with one of those processors assigned to be the 
service point 10. Generally, the service point is chosen to 



be the processor having the smallest distance to the other 
processors. The term "distance" is used herein as a mea- 
sure of the communication time required between proces- 
sors. The service point has several tasks in addition to its 
own regular computing load: (1) detecting a faulty proces- 
sor elsewhere in the system; (2) replacing a faulty proces- 
sor by reassigning that processor's tasks to other proces- 
sors; (3) monitoring the tasks being performed by the 
other processors; and (4) balancing the load on the sys- 
tem to ensure optimum performance. Figure IB illustrates 
a situation where a processor 3 has had a fault detected 
by the service point 10, and has been removed from the 
system; the remainder of the system continues to operate. 
[0005] Though fault tolerance using redundant computation has 
been used for some time, the self-healing and self- 
configuring features of current autonomic computation 
systems raise several new concerns, for example: (1) The 
self-configurable and the self-adjustable features of an 
autonomic system work much better when all of the pro- 
cessors (including those distantly located) are uniform and 
interchangeable. This means that the service points 
should not be special processors but rather chosen from 
the same set of processors working an extra load. 



[0006] (2) Usually, in parallel and scalable computer architectures 
the number of service points is fixed and cannot be speci- 
fied as a fraction of the number of active processors. 
However, having too few service points causes the self- 
healing mechanism to be too slow; having too many ser- 
vice points degrades the overall performance of the sys- 
tem. 

[0007] (3) since the autonomic computing system works in a dy- 
namic environment, it is important to dynamically opti- 
mize clustering and service point assignment to optimize 
system performance. It should be noted that in an on- 
demand computing environment, the total number of pro- 
cessors (and thus the composition of clusters and assign- 
ment of service points) is constantly changing in response 
to the computing load. 

[0008] | n the self-configuring environment of an autonomic 

computing system, it generally is not possible to preas- 
sign the service points. Therefore, depending on the re- 
quirements of the situation any current processor can be 
dynamically assigned to be a service point. On the other 
hand, creating too many service points leads to a large 
computational load on the system. It is desirable, there- 
fore, to keep the number of service points limited to a 



certain fraction of the working processors. 

[0009] The current problem is, therefore: given a set of proces- 
sors in a distributed and dynamic environment, and a 
number representing the fractional value of the ratio of 
the maximum number of service points to the total num- 
ber of working processors, to determine the service points 
and the processors each service point would service. 

[0010] The idea of clustering has been successfully applied to 

many other fields. However, in all the above-noted appli- 
cations areas the number of clusters cannot be specified a 
priori. It is necessary to put an upper bound to the num- 
ber of clusters so that the overhead for extra service 
points is always bounded. The problem of clustering with 
a fixed limit is generally known as intractable: that is, an 
efficient optimal solution does not exist. There is still a 
need, however, for a solution that is efficient though sub- 
optimal. More particularly, there is a need for an efficient 
procedure for dynamically assigning the various proces- 
sors in a system to clusters, and for assigning service 
points within each cluster, to ensure optimum perfor- 
mance (including self-configuring and self-healing) of the 

system. 
Summary of Invention 



[0011] The present invention addresses the above-described 

need by providing a method is described for configuring a 
system having a plurality of processors to provide the 
system with at least one cluster of processors, where each 
cluster has one service point. According to the present in- 
vention, this is done by computing a distance from each 
processor to other processors in the system. A plurality of 
total distances is then computed, where each total dis- 
tance is associated with one processor. A minimum total 
distance is determined from the plurality of total dis- 
tances. One processor is assigned to be the service point; 
this processor is the processor having the minimum total 
distance associated therewith. 

[0012] According to another embodiment of the invention, the 
method further includes partitioning the system into a 
plurality of clusters. This partitioning process may include 
sorting the processors in accordance with the total dis- 
tance associated with each processor; assigning each pro- 
cessor to one of two clusters; determining a minimum to- 
tal distance for the processors in each cluster in accor- 
dance with the plurality of total distances associated with 
the processors in that cluster; and assigning as the service 
point for each cluster the processor having the minimum 



total distance associated therewith in said cluster. 
[0013] According to a further embodiment of the invention, one 
of two clusters is subdivided into two clusters, thereby 
partitioning the system into three clusters. A minimum 
total distance is determined for the processors in each of 
those three clusters in accordance with the plurality of to- 
tal distances associated with the processors in the three 
clusters. The processors are assigned to the three clusters 
in accordance with the minimum total distance; the pro- 
cessor having the minimum total distance associated 
therewith in each cluster is assigned as the service point 
for that cluster. 

[0014] According to an additional embodiment of the invention, 
the processors may be of different types, and the proces- 
sors are assigned to clusters in accordance therewith. 

[0015] The system may be configured dynamically when a pro- 
cessor is added to or removed from the system. Further- 
more the partitioning of the system may be dynamically 
changed when a processor is removed from the system. 

[0016] | n eacn cluster, another processor may be assigned as a 
backup service point, to assume the functions of the ser- 
vice point if the service point is removed from the system. 

[0017] | n accordance with another embodiment of the invention, 



a computer-readable storage medium is provided, having 
stored therein instructions for performing a method for 
configuring a system having a plurality of processors to 
provide the system with at least one cluster of processors, 
each cluster having one service point, This method com- 
prises the steps of computing a distance from each pro- 
cessor to other processors in the system; computing a 
plurality of total distances, where each total distance is 
associated with one processor; determining a minimum 
total distance from the plurality of total distances; and as- 
signing as the service point the processor having the min- 
imum total distance associated therewith. 
Brief Description of Drawings 

[0018] Figure 1A illustrates a typical arrangement of processors 
in a system having a service point. 

[0019] Figure IB illustrates the system of Figure 1A after a faulty 
processor is removed from the system. 

[0020] Figure 2 illustrates a system having multiple processors, 
with distances between the processors determined in ac- 
cordance with the present invention. 

[0021] Figure 3A illustrates a system having multiple processors 
which is to be configured into clusters with service points, 
in accordance with an embodiment of the invention. 



[0022] Figure 3B schematically illustrates the procedure for com- 
puting the total distance from one processor to other pro- 
cessors in a system, in accordance with the invention. 

[0023] Figure 3C is a flowchart detailing steps in algorithm(N,l) 
in accordance with an embodiment of the invention. 

[0024] Figure 3D schematically illustrates a system divided into 
two clusters. 

[0025] Figure 3E is a flowchart detailing steps in algorithm(N,2) 
in accordance with an embodiment of the invention. 

[0026] Figure 3F-1 schematically illustrates a system wherein a 
cluster is subdivided, in accordance with an embodiment 
of the invention. 

[0027] Figure 3F-2 schematically illustrates an alternative subdi- 
vision of clusters in a system. 

[0028] Figure 3G is a flowchart detailing steps in algorithm(N,p) 
in accordance with an embodiment of the invention. 

[0029] Figure 3H shows the result of a process for configuring 

clusters and assigning service points in the system of Fig- 
ure 3 A. 

[0030] Figure 4A illustrates a system having multiple processors 
of different types which is to be configured into clusters 
with service points, in accordance with a further embodi- 
ment of the invention. 



[0031] Figure 4B shows the result of a process for configuring 

clusters and assigning service points in the system of Fig- 
ure 4A. 

[0032] Figure 5A illustrates a system having multiple processors, 
where one processor is to be removed from the system. 

[0033] Figure 5B shows the system of Figure 5A, after a proces- 
sor has been removed. 

[0034] Figure 5C illustrates a system having multiple processors, 
where one processor is to be added to the system. 

[0035] Figure 5D illustrates a system having two clusters, where 
one processor is to be removed from the system. 

[0036] Figure 5E illustrates the system of Figure 5D after a pro- 
cessor has been removed, where the clusters have been 
reconfigured. 

[0037] Figure 6 illustrates a system having multiple processors 

configured into clusters each having a service point, 

where the service point of a cluster is to be removed from 

the system. 
Detailed Description 

[0038] | n the following description it will be assumed that the 

system has a set of N processors cl, c2, , cN (see Fig- 
ure 2). A distance function d(i j) is defined between pro- 
cessors ci and cj, so that d(i,i) = 0 and d(i,j) satisfies the 



triangular inequality: d(i j) + d(j,k) is greater than or equal 
to d(i,k). In terms of communication between processors, 
this means that communication through the j-th proces- 
sor must be slower than direct communication between 
the i-th and k-th processors. The distance d(i j) is a func- 
tion of the speed of the communication network, the rate 
of data exchange and data volume, and the geographic 
location of the processors. 
[0039] a fractional value f is defined as the maximum fraction of 
the N processors that can take the extra load of acting as 
service points. A number p = N * f then represents the 
maximum number of processors acting as service points. 
Accordingly, the system may have a maximum of p clus- 
ters. 

[0040] According to the invention, a method is provided for as- 
signing clusters and service points in a system of N pro- 
cessors having p clusters, in several cases of increasing 
complexity, as follows: 

[0041] c ase 1: Static system of uniform processors. 

[0042] The system of processors cl, c2, , cN is to be divided 

into p different clusters, where each cluster is identified 
with a service point so that the sum of the maximum dis- 
tance of all the processor belonging to a cluster to the 



corresponding service points of the cluster is minimized. 

[0043] For example, Figure 3A illustrates a system 30 having 

N=20 processors which is to be divided into p=4 clusters. 
In this example the fraction f is thus 1/5; in practice f is 
much smaller, generally in the range 1/50 to 1/100. 

[0044] The method will first be described for simple cases when 
p = 1 and p =2, that is, for 1 and 2 clusters. These two 
cases form the basis of the nontrivial case. 

[0045] For p=l, that is, for a single cluster, the following algo- 
rithm is used. 

[0046] Algorithm^, 1): 

[0047] For i = l to i = N do: 

[0048] set d_i = SUM {d(i,j)} for j = 1 to N and j not equal to i; 
[0° 49 ] Output i so that d_i is smallest for i = 1 to N. 

[0050] Figure 3B illustrates a system having N=10 processors 
where the distance d(l,j) is computed for processor cl. 
The sum of the distances d(l,2), d(l,3), ... , d(l,10) is the 
total distance for processor cl. 

[0051] Figure 3C is a flowchart illustrating algorithm (N,l). Be- 
ginning with the first processor (i = l in step 301), the dis- 
tance is computed between that processor and each of the 
other processors (step 302). The sum of these distances is 



the total distance for the i-th processor in the cluster 
(step 303). The processor whose total distance is the min- 
imum (step 304) would then be the service point for the 
cluster. 

[0052] For p = 2, that is, for two clusters, the following algorithm 
is used, in which the method of algorithm(N,l) is used it- 
eratively. As an initial step, the entire set of N processors 
is treated as a single cluster, and the foregoing algorithm 
(N,l) is used to compute a total distance for each proces- 
sor; that is, each of the N processors has a total distance 
representing the sum of distances from that processor to 
the other processors. This distance for the i-th processor 
is denoted as d(i,l). The N processors may then be sorted 
according to this distance. 

[0053] Algorithm^, 2): 

[0054] i sort the processors by distance d(i,l). 

[0055] 2. Initialize two clusters LI and L2, where LI contains all 
N processors and L2 contains 0 processors. 

[0056] 2.i Compute the point of service for LI and its smallest 
distance using algorithm(N,l); 

[0057] 2.ii Initialize d_0 to that distance. 

[0058] 3, scan the processors cl, c2, cN in their sorted order: 



[0059] 3.i: For each processor c_i, remove it from LI and put it in 
L2. 

[0060] 3 _ i i _ Compute the point of service for both LI and L2 and 

the smallest sum of distances d_li and d_2i; 
[° 061 ] 3.iii. Set the sum of distances d_i = d_li + d_2i. 

[0062] 4_ choose the distance di for i = 1 to N so that d_i is min- 
imum and output the clusters LI and L2 for that distance. 

[0063] Figure 3D is a schematic illustration of the system 30, 
with N = 10 processors, divided into two clusters LI and 
L2. Figure 3E shows a flowchart illustrating algo- 
rithm^^). As noted above, the total distance for each 
processor in the system is first computed using algo- 
rithm^,!) (step 321), and the processors are then sorted 
according to this distance (step 322). In the initial assign- 
ment of clusters, all 10 processors are in cluster LI, and 
no clusters are in cluster L2. The initial minimum total 
distance d_0 is thus the same as that previously computed 
using algorithm (N,l) (step 323). Each processor 
(proceeding in the sorted order by distance, as computed 
using algorithm(N,l), beginning with the smallest dis- 
tance) is successively moved from cluster LI to cluster L2 
(step 324), and the minimum total distance for each clus- 
ter is computed using algorithm(N,l) for each iteration 



(step 325). In the i-th iteration, the minimum distance is 
denoted d_li for cluster LI and d_2i for cluster L2. A 
combined minimum distance d_i = d_li + d_2i is then 
computed (step 326). In one of the N iterations, the value 
of d_i will be a minimum; this value is chosen as the total 
distance for the system, with the corresponding cluster 
assignments (step 327). 

[0064] For the non-trivial case of p clusters the above algo- 
rithm^, 2)is used iteratively. At each stage, each of the 
created clusters is divided into two parts using algo- 
rithm^, 2);the cluster assignments that minimize the total 
distance function are then identified. 

[0065] For example, the system of N=10 processors shown in 

Figure 3D is divided into two clusters LI and L2, as the al- 
gorithm^^) is performed. Then in the next stage, cluster 
LI is divided into two clusters Ll.l and LI. 2, while L2 is 
left intact (Figure 3F-1). The total distance (denoted d2.1) 
is then computed, as the sum of the cluster distances of 
Ll.l, L2.1 and L2. Then cluster L2 is divided into two 
clusters L2.1 and L2.2, with LI left intact (Figure3F-2). 
The total distance (now given as d2.2) is again computed, 
as the sum of the cluster distances of LI, L2.1 and L2.2. 
The smaller of d2.1 and d2.2 is chosen, along with the 



cluster configuration corresponding thereto. Thus if d2.1 
< d2.2, then the clusters Ll.l, LI. 2 and L2 are chosen. 
The subdivisions Ll.l and LI. 2 are renamed LI and L3 re- 
spectively, and the total distance for this set of clusters d3 
is set equal to d2.1. 

[0066] | n the same manner, at any iterative stage q + 1, where q = 
2, N-l, there are existing clusters LI, L_q. Then q 
sets of q + 1 clusters are created by dividing each of the 
clusters LI, L_q one at a time into two parts LJj.l} 
and L_{j . 2 } for j = 1, q while keeping all other clusters 

intact. The distance d_{q.l} {d_{q.q}} is then computed 

for each of these clusters. The smallest distance among 
these distances is chosen, along with the corresponding 
cluster configuration. Suppose that d_{m,j} = min {d_{q.l}, 
{d_{q.q}} }. Then the corresponding cluster {LI, L2, 

Lm.l, Lm.2, , Lq} is chosen. Then the distance 

d_{q + l} is set equal to d_{m,j}, Lm = Lm.l and L_{q + 1} = 
Lm.2. This process yields a set of q + 1 clusters, each with 
its own service point. 

[0067] T ne algorithm for p clusters is as follows: 

[0068] Algorithm (N,p): 

[0069] o. Divide the set of processors into two clusters by using 



algorithm(N,2). 
[0070] l. for i=2 to p do: 

[0071] i.i. for j = l to i do: 

[0072] 1.1.1 D i v ide Lj to Lj.l and Lj.2 using algorithm(N,2). 

[0073] 1.1.2 Sum the total distance of the clusters and call it 
d_{i.j}. 

[0074] 1.2 set d_{i + l} = min {d_{i.j} for j = 1 to i}. 

[0075] |_ e t the value of j for which the minimum occurs be j = m. 

[0076] 1.3 set Lm = Lm.l 

[0077] 1.4 set L_{i + 1} = Lm.22 

[0078] 2. Return the results LI, ... Lp 

[0079] Figure 3G shows a flowchart for the foregoing algo- 
rithm^, p). The system is first divided into two clusters 
using the algorithm(N,2) (step 351). An iteration is then 
performed where the j-th cluster is divided into two using 
algorithm(N,2) (step 352), and the total distance for all of 
the clusters (including the two subdivided clusters) is 
computed (step 353). The minimum of these total dis- 
tances is found (step 354), and the cluster configuration 
corresponding thereto is chosen (step 355). The clusters 



are then renamed as described above with reference to 
Figures 3F-1 and 3F-2 (step 356). The clusters are redi- 
vided until a set of p clusters is found (step 357). The sys- 
tem then has a set of p clusters, each with one service 
point. A result of this process is shown schematically in 
Figure 3H (compare Figure 3A), for N=20 processors and 
p=4 clusters. The system 30 is divided into four clusters 
31, 32, 33, 34 with respective service points 311, 312, 
313, 3 14. Case 2:Static system of non-uniform processors. 
[0080] Figure 4A is a schematic illustration of a system 40 where 
the processors are not all uniform, but are pregrouped 
according to their operating systems and/or technology 
and the service point for a type of processor must be of 
the same type. In system 40, the processors are of two 
types 41 and 42. It will be appreciated that the different 
types of processors may communicate with each other 
over a network running a processor-independent protocol 
such as TCP/IP. For example, an autonomic computing 
environment may be set up using a set of Windows-based 
processors running on Intel chips and a set of servers 
running AIX. The service point for the Intel systems must 
be an Intels ystem and the service point for the AIX sys- 
tem must be an AIX based system. The system 40 thus 



must be divided into clusters so that all processors in a 
cluster are of the same type. 

[0081] The solution for this case can be obtained from that of 
Case 1 discussed above. Suppose there are m different 
types of processors and the service point for each of these 
different types can be only from its own kind. In this case 
we first check if m < p, where p is the maximum number 
of clusters. If m > = p (which is very unlikely in a practical 
system), then the clusters are simply grouped according to 
the type of processors. For m < p, the clusters are initial- 
ized by grouping the processors into m clusters LI, Lm 
each containing only the same type of clusters. Then the 
algorithm(N,p) is applied to these m clusters. A result of 
this process is shown schematically in Figure 4B, with 
N=20, m=2 and p=4. Processors 41 are grouped into 
clusters 411, 412, 413, while processors 42 are grouped 
into cluster 420. 

[0082] c ase 3. Dynamic systemp; rocessors entering and leaving 
system. 

[0083] This case is an extension of Case 2 where the number of 
processors is dynamically changing. Processors may enter 
or leave the system, either because of faults or because 
they are operating in an on-demand environment. In ad- 



dition, a change in the number of processors can lead to a 
change in the number or arrangement of clusters, so that 
the clusters also are dynamically changing. Furthermore, if 
the processors are of more than one type (and thus in 
more than one distinct group), the number of groups may 
be dynamically changing. For example, Figure 5A shows a 
situation where processor 501 in cluster 50 is to be re- 
moved from the system. The cluster is reconfigured as 
cluster 51, as shown in Figure 5B. It is possible that the 
removal of processor 501 will result in a reassignment of 
the service point of cluster 51. 
[0084] The total distance between the processors in a cluster and 
the service point must be minimized dynamically to avoid 
loss in performance of the system. It is possible to recom- 
pute the clusters by using algorithm(N,p), each time one 
or more processors are added to the system or taken out 
of the system. This approach is very costly in terms of 
computing overhead and may cause significant perfor- 
mance degradation. Therefore, it is necessary to dynami- 
cally update the system for each of two cases: (i) adding a 
processor to the system, and (ii) removing a processor 
from the system. Procedures for these two cases are de- 
scribed below. It is assumed that an optimal set of clus- 



ters has been previously found using the algorithm(N,p). 

[0085] (j) a new processor is added to the system: In this case, 
we test the added processor against all the service points 
of all the existing clusters. Then the new processor is 
added to the cluster that minimizes the total distance. 
Then algorithm(N,l) is used at that cluster to which the 
new processor is added to find a new service point. Note 
that this procedure does not necessarily optimize the sys- 
tem but is very fast. This procedure is illustrated 
schematically in Figure 5C. In Figure 5C, system 55 in- 
cludes two clusters 551 and 552; it is desired to add new 
processor 560. Processor 560 will be added to cluster 
552, because the total distance of the system is thereby 
kept at a minimum. 

[0086] (jj) An existing processor is removed from the system: In 
this case, it is first determined from which cluster the 
processor is removed. There are two cases for this situa- 
tion. 

[0087] a ) |f t he removed processor is not the one furthest from 
the service point, then it does not affect the maximum 
distance for that cluster. In that case, the processor is re- 
moved and the new service point is recomputed by using 
algorithm(N,l) for this cluster. This situation is illustrated 



in Figure 5D, where system 57 includes clusters 571 and 
572, and processor 574 is to be removed. No changes to 
the cluster configuration occur as a result. 

[0088] b) If the removed processor is the one furthest from the 
service point then its removal does affect the maximum 
distance from the service point of the cluster. In that case, 
the processor is first removed from the cluster. In general, 
the system must then be rebalanced to maintain optimum 
performance. When a processor is removed from the cur- 
rent cluster, it may be possible to make the system more 
efficient by putting in another processor from another 
cluster to the current cluster. This situation is illustrated 
in Figure 5E, where processor 577 has been removed from 
the system (compare Figure 5D). As a result, new clusters 
578 and 579 are formed to minimize the total distance of 
the system. To accomplish this, we first note that at each 
step of the algorithm (N,p) to create the final set of clus- 
ters, a previously created cluster is divided using the al- 
gorithm^,?). Therefore the whole process of creating 
clusters can be expressed in a binary tree. All the final 
clusters are the leaf nodes of this binary tree. 

[0089] when a processor is removed from a cluster, we consider 
the other sibling node of the cluster from which the pro- 



cessor is removed. The processors in the sibling node are 
examined as to whether moving a processor from the sib- 
ling node to the current cluster will minimize the overall 
distance of the system. If the overall distance of the whole 
system is not affected, no action need be taken. However, 
if the overall distance is decreased by moving the proces- 
sor, a processor from the sibling node is removed and 
placed in the current cluster, so that the overall distance 
is again minimized. If the sibling node is a leaf node or an 
existing cluster, no further action need be taken. Other- 
wise, algorithm(n,2) is used to balance its children after 
removal of the processor. 

[0090] The adjustments described above are local in nature and 
can be performed very fast; they do not, however, globally 
balance the whole system. Therefore, after a number of 
dynamic adjustments, the system may be considerably off 
balance; it may be necessary to reoptimize the system us- 
ing algorithm(N,p) from time to time. 

[0091] c ase 4- Backup for service points. 

[0092] jo ensure fail-safe operation of the system, the present 
invention also includes a method for providing dual ser- 
vice points for each cluster, where a backup service point 
keeps am irror image of the information stored within the 



active service point and begins to function as the active 
service point in case of the failure of the service point 
during an update. It will be appreciated that the service 
points are no different from the regular processors of the 
system. Therefore, it is possible that a service point may 
have a fault and be removed from the system. In order to 
prevent system failure in this case, service information of 
the service point is stored in another processor in the 
same cluster. For example, system 60 shown in Figure 6 
includes four clusters61, 62, 63 and 64, where the service 
point of cluster 61 is processor 601, and processor 602 is 
the backup processor. In case of a failure in the active 
service point 601, the backup service point 602 assumes 
the functions of the service point. Algorithm(N,l) may 
then be used to rebalance the cluster if necessary. 
[0093] c ase 5; Dynamic number of service points. 

[0094] a more general problem is the situation where the maxi- 
mum allowable limit on the number of service points p is 
also changed. This can happen when a significant number 
of processors are either added to the system or taken out 
of the system at one time. This may also happen if the 
fractional value f (giving the maximum allowable number 
of service points p according to p = N * f ) is also 



changed. There are two cases to consider:(i)The new 
number of maximum allowable service points, pi, is 
greater than p. In this case, the algorithm(N,p) is used 
with the current number of clusters, but the algorithm is 
performed with the new number of clusters pi. This is 
similar to the solution described in Case 1, where in the 
first step each of the p clusters is divided once while oth- 
ers are left intact. The total sum of distances to the clos- 
est service point is computed in each case, and the cluster 
configuration is chosen that minimizes the total sum. This 
process is performed iteratively for p+1 pi. 

[0095] (jj) The new number of maximum allowable service points, 
p2 is less than p. In this event it is necessary to recall the 
order in which each of the p clusters were generated (see 
the above discussion of Case 1). The clusters are recom- 
bined in the reverse order until only p2 clusters remain. 

[0096] while the present invention has been described in terms 
of specific embodiments, it is evident in view of the fore- 
going description that numerous alternatives, modifica- 
tions and variations will be apparent to those skilled in the 
art. Accordingly, the invention is intended to encompass 
all such alternatives, modifications and variations which 
fall within the scope and spirit of the invention and the 



following claims. 
[0097] i claim: 



